Rank in Wordlist | Frequency | Word |
---|---|---|
6926 | 72 | 1,000 |
6989 | 71 | 10,000 |
7401 | 65 | 100,000 |
8393 | 53 | 20,000 |
8705 | 50 | 50,000 |
8795 | 49 | 3,000 |
9400 | 44 | 2,000 |
9676 | 42 | 5,000 |
11747 | 31 | 4,000 |
11963 | 30 | Accept-Encoding,User-Agent |
Rank in Wordlist | Frequency | Word |
---|---|---|
8454 | 53 | off)(Nevermind |
15842 | 19 | Keyword(s |
18051 | 15 | A(H1N1 |
21681 | 11 | 32(0)2 |
21682 | 11 | 33(0)3 |
22697 | 11 | partner(s |
24447 | 9 | 21(1 |
25609 | 9 | language(s |
28215 | 7 | 10(1 |
28315 | 7 | A(H7N9 |
Rank in Wordlist | Frequency | Word |
---|---|---|
8454 | 53 | off)(Nevermind |
13699 | 24 | %) |
21658 | 11 | %). |
21681 | 11 | 32(0)2 |
21682 | 11 | 33(0)3 |
30851 | 6 | 0)2 |
34164 | 5 | 0)My |
38754 | 4 | %), |
41815 | 4 | SMEST)Examples |
42255 | 4 | U)SIM |
Rank in Wordlist | Frequency | Word |
---|---|---|
2846 | 247 | 20% |
3159 | 216 | 50% |
3185 | 214 | 100% |
3844 | 166 | 10% |
4065 | 155 | 40% |
4236 | 147 | 30% |
4676 | 130 | 80% |
5306 | 108 | 90% |
5571 | 101 | 60% |
5611 | 100 | 25% |
Rank in Wordlist | Frequency | Word |
---|---|---|
1213 | 628 | R&D |
7491 | 64 | S&D |
7873 | 59 | S&T |
11594 | 32 | Q&A |
14187 | 23 | R&I |
24858 | 9 | M&A |
25010 | 9 | R&D&I |
26157 | 8 | AT&T |
28621 | 7 | ECA&D |
28987 | 7 | M&E |
Rank in Wordlist | Frequency | Word |
---|---|---|
9287 | 45 | $1 |
16223 | 18 | $100 |
19605 | 13 | $10 |
21007 | 12 | US$ |
21657 | 11 | $2 |
22989 | 10 | $5 |
24395 | 9 | $20 |
24396 | 9 | $300 |
26051 | 8 | $200 |
26052 | 8 | $500 |
Rank in Wordlist | Frequency | Word |
---|---|---|
82650 | 1 | %" |
Rank in Wordlist | Frequency | Word |
---|---|---|
587 | 1214 | it's |
1010 | 744 | It's |
1093 | 691 | don't |
1335 | 573 | that's |
1369 | 559 | they're |
1416 | 541 | That's |
1605 | 474 | I'm |
1959 | 388 | you're |
2024 | 370 | EU's |
2453 | 296 | Europe's |
Rank in Wordlist | Frequency | Word |
---|---|---|
1771 | 431 | and/or |
7312 | 67 | his/her |
8428 | 53 | companies/no |
8482 | 52 | Greens/EFA |
8798 | 49 | 874/2004 |
9402 | 44 | 9/11 |
9873 | 41 | GUE/NGL |
10338 | 38 | 24/7 |
13681 | 25 | to/from |
14512 | 22 | HIV/AIDS |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots